Tweedie's Formula and Selection Bias.
نویسنده
چکیده
We suppose that the statistician observes some large number of estimates z(i), each with its own unobserved expectation parameter μ(i). The largest few of the z(i)'s are likely to substantially overestimate their corresponding μ(i)'s, this being an example of selection bias, or regression to the mean. Tweedie's formula, first reported by Robbins in 1956, offers a simple empirical Bayes approach for correcting selection bias. This paper investigates its merits and limitations. In addition to the methodology, Tweedie's formula raises more general questions concerning empirical Bayes theory, discussed here as "relevance" and "empirical Bayes information." There is a close connection between applications of the formula and James-Stein estimation.
منابع مشابه
Selection Bias in reporting the prevalence of Transfusion Transmitted Infection Diseases in Iranian Hemophiliacs
متن کامل
Efficacy of cognitive-behavioural therapy and other psychological treatments for adult depression: meta-analytic study of publication bias.
BACKGROUND It is not clear whether the effects of cognitive-behavioural therapy and other psychotherapies have been overestimated because of publication bias. AIMS To examine indicators of publication bias in randomised controlled trials of psychotherapy for adult depression. METHOD We examined effect sizes of 117 trials with 175 comparisons between psychotherapy and control conditions. As ...
متن کاملModel Selection in Classification: the Swapping Method
In this article, the bias of the empirical error rate in supervised classification is studied. The exact formula and a robust estimator of the bias are given. From these results, we propose a new penalized criterion to perform model selection in classification. Applications to simulated and real data are presented.
متن کاملEstimating Gene Expression and Codon-Specific Translational Efficiencies, Mutation Biases, and Selection Coefficients from Genomic Data Alone‡
Extracting biologically meaningful information from the continuing flood of genomic data is a major challenge in the life sciences. Codon usage bias (CUB) is a general feature of most genomes and is thought to reflect the effects of both natural selection for efficient translation and mutation bias. Here we present a mechanistically interpretable, Bayesian model (ribosome overhead costs Stochas...
متن کاملThe Accuracy and Bias of Single-Step Genomic Prediction for Populations Under Selection
In single-step analyses, missing genotypes are explicitly or implicitly imputed, and this requires centering the observed genotypes using the means of the unselected founders. If genotypes are only available for selected individuals, centering on the unselected founder mean is not straightforward. Here, computer simulation is used to study an alternative analysis that does not require centering...
متن کاملذخیره در منابع من
با ذخیره ی این منبع در منابع من، دسترسی به آن را برای استفاده های بعدی آسان تر کنید
عنوان ژورنال:
- Journal of the American Statistical Association
دوره 106 496 شماره
صفحات -
تاریخ انتشار 2011